test: HTTP/CLI route matrix and structured API error codes by clean6378-max-it · Pull Request #42 · cppalliance/claude-code-chat-browser

clean6378-max-it · 2026-05-21T00:55:54Z

Closes #41

Summary

Week 3 Wednesday work for claude-code-chat-browser (8 story points):

HTTP + CLI test depth (5 pt) — Full-app route coverage (test_api_routes.py), shared fixtures (tests/conftest.py), CLI subprocess e2e (test_cli_e2e.py), and search limit unit tests (test_search.py).
Structured error codes (3 pt) — ErrorCode enum + error_response() helper; every API JSON error includes "code" and "error"; README error catalog.
Canonical issues: fab633e4-4f9d-56b4-97d4-cfc84b5bbdd7 (HTTP/CLI tests) · 8c01c675-d0e7-5e5c-a179-472a2fea5056 (structured error codes)
Detail guide: Doc/Issues/chen-week3-wednesday-claude-code-chat-browser-guide.md

Changes

API

Area	Change
`api/error_codes.py`	New — `ErrorCode` (`StrEnum`) and `error_response()`
`api/search.py`	`_parse_limit()`; 400 `SEARCH_INVALID_LIMIT`; max limit 500
`api/sessions.py`	All errors use structured codes (`INVALID_PATH`, `SESSION_NOT_FOUND`, `PARSE_ERROR`, `INTERNAL_ERROR`)
`api/export_api.py`	Structured codes for invalid body/since, 422 empty export, session export errors

Tests

File	Purpose
`tests/conftest.py`	Shared `client` fixtures, `export_state_file`, `assert_error_response()`
`tests/fixtures/*.jsonl`	Minimal session seeds for API/CLI tests
`tests/test_api_routes.py`	Stats, export session/state/bulk, index smoke, path safety, parse-failure leak guard
`tests/test_cli_e2e.py`	`list`, `stats`, `export`, invalid `--since` (UTF-8 env for Windows)
`tests/test_error_codes.py`	Parametrized `"code"` field assertions
`tests/test_search.py`	Limit validation + `SEARCH_INVALID_LIMIT`
`tests/test_export_api_bulk.py`	Assert `"code"` on isolated blueprint tests

Docs

README.md — API error code table; bulk export 422 example includes "code"

Example error response

{
  "error": "Invalid limit: must be a positive integer",
  "code": "SEARCH_INVALID_LIMIT"
}

<!-- This is an auto-generated comment: release notes by coderabbit.ai -->
## Summary by CodeRabbit

* **New Features**
  * API error responses now include stable, machine-readable error codes alongside human messages.

* **Documentation**
  * Bulk Export docs expanded; new API error codes section and standardized JSON error shape documented.

* **Bug Fixes**
  * Unified, validated error handling across search, export, and session endpoints (including explicit errors for invalid parameters, path issues, parse failures, and a 422 when bulk export yields nothing).

* **Tests**
  * Added extensive unit, integration, and end-to-end tests for API, CLI, and frontend utilities.

* **Chores**
  * CI and JS test tooling configuration updates.

<!-- review_stack_entry_start -->

[![Review Change Stack](https://storage.googleapis.com/coderabbit_public_assets/review-stack-in-coderabbit-ui.svg)](https://app.coderabbit.ai/change-stack/cppalliance/claude-code-chat-browser/pull/42?utm_source=github_walkthrough&utm_medium=github&utm_campaign=change_stack)

<!-- review_stack_entry_end -->
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

coderabbitai · 2026-05-21T00:56:06Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Centralizes API error handling with an ErrorCode enum and error_response() helper, migrates search/sessions/export endpoints to structured error codes, updates README, and adds fixtures plus extensive HTTP/CLI/JS tests and CI/tooling updates.

Changes

Structured Error Codes and Comprehensive API/CLI Test Coverage

Layer / File(s)	Summary
Error Code Foundation and Documentation `api/error_codes.py`, `README.md`	`ErrorCode` enum and `error_response(code, message, status, **extra)` helper; README documents standard error JSON (`error` + `code`) and lists common codes.
Search API Limit Validation `api/search.py`, `tests/test_search.py`	Adds `_parse_limit()` and `_DEFAULT_LIMIT`/`_MAX_LIMIT`; `/api/search` validates `limit` and returns `SEARCH_INVALID_LIMIT` (HTTP 400) via `error_response()` on invalid input. Tests cover integer, float, non-numeric, zero/negative, whitespace, omitted-limit, and empty-query behaviors.
Sessions API Error Handling `api/sessions.py`	`get_session` and `get_session_stats` use `error_response()` for `INVALID_PATH`, `SESSION_NOT_FOUND`, `PARSE_ERROR`, and `INTERNAL_ERROR`, replacing ad-hoc JSON/status responses.
Export API Error Handling `api/export_api.py`, `tests/test_export_api_bulk.py`	`bulk_export` returns `INVALID_REQUEST_BODY` (400) for non-object bodies, `INVALID_SINCE_MODE` (400) for invalid `since`, and `EXPORT_NOTHING_TO_EXPORT` (422) with echoed `since` when empty. `export_session` returns `SESSION_NOT_FOUND` and `INVALID_PATH` via `error_response()`. Tests extended to assert `code` fields.
Shared Test Fixtures and Helpers `tests/conftest.py`, `tests/fixtures/session_*.jsonl`	Adds `assert_error_response(resp, expected_code=None)`, `export_state_file` fixture, and Flask test-client fixtures (`client`, `client_single`, `client_empty`, `client_thinking`) seeded with JSONL fixtures.
Full-App HTTP Route Tests `tests/test_api_routes.py`	Comprehensive route tests covering index HTML, session stats (happy/404/invalid path), search limit capping and validation, project sessions invalid-path behavior, export state, bulk export error cases, and export-session formats and errors.
Parametrized Error Code Tests `tests/test_error_codes.py`	Parametrized matrix asserting endpoint failures return expected HTTP status and `ErrorCode` values; includes a specific 422 empty-export check for `EXPORT_NOTHING_TO_EXPORT`.
CLI End-to-End Tests `tests/test_cli_e2e.py`	End-to-end subprocess tests for `scripts/export.py`: `list` (exit 0), `list` unknown project (exit 0 with message), `stats` (exit 0), invalid `--since` (non-zero exit), `export --no-zip` writes Markdown files.
JS Unit Tests and Vitest Config `static/js/shared/*.test.js`, `vitest.config.js`, `package.json`	Adds Vitest unit tests for markdown, state, utils; `vitest.config.js` and `package.json` configure JS testing environment and coverage.
CI and Tooling `.github/workflows/ci.yml`, `.gitignore`, `pyproject.toml`, `requirements-dev.txt`	Pins GH Actions to SHAs, adds `integration-tests` CI job that runs selected pytest with coverage artifact upload, ignores coverage artifacts in `.gitignore`, and adds pytest/coverage settings via `pyproject.toml` and `requirements-dev.txt`.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related issues

Add frontend and integration test suite #39: This PR aligns with the request to add integration and frontend tests plus structured error codes and test fixtures.

Possibly related PRs

cppalliance/claude-code-chat-browser#34: Related changes to api/export_api.py bulk export handling and since validation; error-code migration applies to overlapping logic.
cppalliance/claude-code-chat-browser#6: Prior export-session behavior changes that relate to the export endpoint modifications in this PR.

Suggested labels

enhancement

Suggested reviewers

jonathanMLDev
wpak-ai
timon0305

🐰 I tidy codes with a nibble and cheer,
Errors get names so clients can hear.
Tests hop along routes, CLI in tow,
Docs list the codes, CI lets them show.
A happy rabbit dances: structured and clear.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 16.67% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title 'test: HTTP/CLI route matrix and structured API error codes' accurately summarizes the main changes: comprehensive test coverage for HTTP routes/CLI and implementation of machine-readable error codes.
Linked Issues check	✅ Passed	The PR addresses all requirements from issue `#41`: (A) HTTP/CLI test matrix via conftest, test_api_routes.py, test_cli_e2e.py, and test_search.py; (B) structured error codes via api/error_codes.py, api/search.py, api/sessions.py, api/export_api.py migrations with ErrorCode enum and error_response() helper; documentation added to README; all acceptance criteria met.
Out of Scope Changes check	✅ Passed	All changes are scoped to requirements: test infrastructure (conftest, test files, fixtures), error code implementation (api/error_codes.py and migrations), documentation (README), and workflow/config updates (CI, pyproject.toml, vitest.config.js). No unrelated refactoring, new routes, or out-of-scope features.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch test/http-cli-errors-and-codes

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

api/export_api.py (1)

287-317: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Guard export_session() parse/stats failures with structured error responses.

This path still has unhandled exceptions, so 500s here won’t consistently return { "error", "code" }.

Proposed fix

-    fmt = request.args.get("format", "md")
-    session = parse_session(filepath)
+    fmt = request.args.get("format", "md")
+    try:
+        session = parse_session(filepath)
+    except Exception:
+        current_app.logger.exception("Failed to parse session %s", session_id)
+        return error_response(
+            ErrorCode.PARSE_ERROR,
+            "Failed to parse session",
+            500,
+        )
+
     rules = current_app.config.get("EXCLUSION_RULES") or []
     if is_session_excluded(rules, session, project_name):
         return error_response(
             ErrorCode.SESSION_NOT_FOUND,
             "Session not found",
             404,
         )
-    stats = compute_stats(session)
+    try:
+        stats = compute_stats(session)
+    except Exception:
+        current_app.logger.exception("Failed to compute stats for %s", session_id)
+        return error_response(
+            ErrorCode.INTERNAL_ERROR,
+            "Failed to compute session stats",
+            500,
+        )
     title_slug = slugify(session["title"], default="session")

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@api/export_api.py` around lines 287 - 317, Wrap calls to
parse_session(session filepath) and compute_stats(session) in try/except blocks
inside export_session so any exceptions are caught and returned via the existing
error_response(...) shape rather than bubbling as 500 HTMLs; specifically, catch
exceptions from parse_session (and return a structured error_response with an
appropriate ErrorCode like SESSION_NOT_FOUND or a new PARSE_ERROR and a 400/404)
and catch exceptions from compute_stats (return ErrorCode.INTERNAL_ERROR or
STATS_COMPUTE_FAILED with a 500), ensuring the rest of the flow
(session_to_json/session_to_markdown and send_file) only runs when parsing and
stats succeed.

🧹 Nitpick comments (4)

tests/test_api_routes.py (1)

68-72: ⚡ Quick win

Harden capped-limit assertion with response-shape check.

This test can pass on non-list payloads (e.g., an unexpected JSON object) because it only checks len(...) <= 500. Assert list shape explicitly before length checks.

Proposed test hardening

 def test_search_limit_capped_at_max(client):
     resp = client.get("/api/search?q=Hello&limit=9999")
     assert resp.status_code == 200
-    assert len(resp.get_json()) <= 500
+    body = resp.get_json()
+    assert isinstance(body, list)
+    assert len(body) <= 500

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/test_api_routes.py` around lines 68 - 72, The
test_search_limit_capped_at_max test currently only checks len(resp.get_json())
<= 500 which will pass for non-list JSON shapes; update the test to first assert
the response body is a JSON list (e.g., use resp.is_json and assert
isinstance(resp.get_json(), list) or equivalent) before asserting its length, so
the shape is validated prior to the capped-limit check in
test_search_limit_capped_at_max.

tests/test_search.py (1)

8-12: ⚡ Quick win

Tighten happy-path assertions to validate limit behavior.

These tests currently confirm status, but not enough of the response contract. Add explicit list-shape checks and verify requested/default limit constraints in assertions.

Proposed assertion upgrades

 def test_limit_integer_string(client_single):
     resp = client_single.get("/api/search?q=Hello&limit=10")
     assert resp.status_code == 200
-    assert isinstance(resp.get_json(), list)
+    body = resp.get_json()
+    assert isinstance(body, list)
+    assert len(body) <= 10
@@
 def test_limit_default(client_single):
     resp = client_single.get("/api/search?q=Hello")
     assert resp.status_code == 200
+    body = resp.get_json()
+    assert isinstance(body, list)
+    assert len(body) <= 200

Also applies to: 26-29

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/test_search.py` around lines 8 - 12, Update the happy-path assertions
in test_limit_integer_string (and the similar test at lines 26-29) to assert the
response is a list, each element has the expected shape (e.g., dict with
expected keys) and that the number of returned items respects the requested or
default limit: assert isinstance(resp.get_json(), list), assert
all(isinstance(item, dict) and set(expected_keys).issubset(item.keys()) for item
in resp.get_json()), and assert len(resp.get_json()) <= requested_limit (for
"/api/search?q=Hello&limit=10" assert <= 10) and also add a case that verifies
the default limit when limit is omitted by calling the endpoint without limit
and asserting len(...) <= default_limit; use the test function names
test_limit_integer_string and the other test to locate and update the
assertions.

tests/test_cli_e2e.py (2)

50-55: ⚡ Quick win

Tighten success assertion for listed project.

On Line 54, or "Project" in proc.stdout can pass even when test-project is missing (header-only output). Assert the seeded project name directly to avoid false positives.

Proposed diff

 def test_cli_list_exits_zero(tmp_path):
     base = _seed_base_dir(tmp_path)
     proc = _run_cli(["list", "--base-dir", str(base)])
     assert proc.returncode == 0
-    assert "test-project" in proc.stdout or "Project" in proc.stdout
+    assert "test-project" in proc.stdout

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/test_cli_e2e.py` around lines 50 - 55, The test
test_cli_list_exits_zero uses a loose assertion that allows header-only output
to pass; tighten it by removing the fallback check ("or 'Project' in
proc.stdout") and assert directly that the seeded project name "test-project"
appears in proc.stdout (use the existing proc from _run_cli in
test_cli_list_exits_zero to perform the check).

70-73: ⚡ Quick win

Assert the invalid --since failure reason, not only exit code.

Line 72 currently accepts any non-zero failure. Add a stderr assertion so the test proves the --since validation path specifically failed.

Proposed diff

 def test_cli_invalid_since_exits_nonzero():
     proc = _run_cli(["--since", "yesterday"])
     assert proc.returncode != 0
+    assert "--since" in proc.stderr or "invalid choice" in proc.stderr.lower()

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/test_cli_e2e.py` around lines 70 - 73, The test
test_cli_invalid_since_exits_nonzero only checks a non-zero exit code; update it
to also assert that _run_cli([...])'s stderr contains the specific validation
error for the --since flag (e.g., assert "Invalid value for '--since'" in
proc.stderr or a substring like "--since" and "invalid" if exact text differs)
so the test verifies the --since validation path failed; keep using the existing
helper _run_cli to capture stdout/stderr and add the stderr assertion alongside
the returncode check.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@api/sessions.py`:
- Around line 65-75: The current single try around parse_session(filepath) and
compute_stats(session) masks parse failures as INTERNAL_ERROR; split the logic
so parse errors are caught and mapped to ErrorCode.PARSE_ERROR. Specifically,
call parse_session(filepath) in its own try/except that catches parsing
exceptions and returns error_response(ErrorCode.PARSE_ERROR, "Failed to parse
session", 400) (include session_id in logs via current_app.logger.exception),
then call compute_stats(session) in a separate try/except that logs and returns
the existing INTERNAL_ERROR via error_response(ErrorCode.INTERNAL_ERROR, "Failed
to compute session stats", 500) if compute_stats fails. Ensure you reference
parse_session, compute_stats, session_id, current_app.logger.exception,
error_response and ErrorCode.PARSE_ERROR in the change.

In `@tests/conftest.py`:
- Around line 22-24: The shared error helper in tests/conftest.py currently only
asserts "code" is present in body; add a type assertion to ensure body["code"]
is a string (e.g., use isinstance(body["code"], str)) before comparing with
expected_code in the helper that contains the assertions around "code" and
expected_code so tests fail if handlers return non-string codes.

---

Outside diff comments:
In `@api/export_api.py`:
- Around line 287-317: Wrap calls to parse_session(session filepath) and
compute_stats(session) in try/except blocks inside export_session so any
exceptions are caught and returned via the existing error_response(...) shape
rather than bubbling as 500 HTMLs; specifically, catch exceptions from
parse_session (and return a structured error_response with an appropriate
ErrorCode like SESSION_NOT_FOUND or a new PARSE_ERROR and a 400/404) and catch
exceptions from compute_stats (return ErrorCode.INTERNAL_ERROR or
STATS_COMPUTE_FAILED with a 500), ensuring the rest of the flow
(session_to_json/session_to_markdown and send_file) only runs when parsing and
stats succeed.

---

Nitpick comments:
In `@tests/test_api_routes.py`:
- Around line 68-72: The test_search_limit_capped_at_max test currently only
checks len(resp.get_json()) <= 500 which will pass for non-list JSON shapes;
update the test to first assert the response body is a JSON list (e.g., use
resp.is_json and assert isinstance(resp.get_json(), list) or equivalent) before
asserting its length, so the shape is validated prior to the capped-limit check
in test_search_limit_capped_at_max.

In `@tests/test_cli_e2e.py`:
- Around line 50-55: The test test_cli_list_exits_zero uses a loose assertion
that allows header-only output to pass; tighten it by removing the fallback
check ("or 'Project' in proc.stdout") and assert directly that the seeded
project name "test-project" appears in proc.stdout (use the existing proc from
_run_cli in test_cli_list_exits_zero to perform the check).
- Around line 70-73: The test test_cli_invalid_since_exits_nonzero only checks a
non-zero exit code; update it to also assert that _run_cli([...])'s stderr
contains the specific validation error for the --since flag (e.g., assert
"Invalid value for '--since'" in proc.stderr or a substring like "--since" and
"invalid" if exact text differs) so the test verifies the --since validation
path failed; keep using the existing helper _run_cli to capture stdout/stderr
and add the stderr assertion alongside the returncode check.

In `@tests/test_search.py`:
- Around line 8-12: Update the happy-path assertions in
test_limit_integer_string (and the similar test at lines 26-29) to assert the
response is a list, each element has the expected shape (e.g., dict with
expected keys) and that the number of returned items respects the requested or
default limit: assert isinstance(resp.get_json(), list), assert
all(isinstance(item, dict) and set(expected_keys).issubset(item.keys()) for item
in resp.get_json()), and assert len(resp.get_json()) <= requested_limit (for
"/api/search?q=Hello&limit=10" assert <= 10) and also add a case that verifies
the default limit when limit is omitted by calling the endpoint without limit
and asserting len(...) <= default_limit; use the test function names
test_limit_integer_string and the other test to locate and update the
assertions.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 93da7cea-ce58-4a41-9562-246a7f450f6a

📥 Commits

Reviewing files that changed from the base of the PR and between 4bbb456 and 20ac332.

📒 Files selected for processing (13)

README.md
api/error_codes.py
api/export_api.py
api/search.py
api/sessions.py
tests/conftest.py
tests/fixtures/session_minimal.jsonl
tests/fixtures/session_with_tools.jsonl
tests/test_api_routes.py
tests/test_cli_e2e.py
tests/test_error_codes.py
tests/test_export_api_bulk.py
tests/test_search.py

coderabbitai

Actionable comments posted: 3

♻️ Duplicate comments (1)

api/sessions.py (1)

65-75: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Split parse and compute error handling in stats endpoint.

At Line 66 and Line 67, parse and stats computation share one except, so parse failures are still returned as INTERNAL_ERROR instead of PARSE_ERROR.

Suggested fix

-    try:
-        session = parse_session(filepath)
-        stats = compute_stats(session)
-        return jsonify(stats)
-    except Exception:
-        current_app.logger.exception("Failed to compute stats for %s", session_id)
-        return error_response(
-            ErrorCode.INTERNAL_ERROR,
-            "Failed to compute session stats",
-            500,
-        )
+    try:
+        session = parse_session(filepath)
+    except Exception:
+        current_app.logger.exception("Failed to parse session %s", session_id)
+        return error_response(
+            ErrorCode.PARSE_ERROR,
+            "Failed to parse session",
+            500,
+        )
+
+    try:
+        stats = compute_stats(session)
+        return jsonify(stats)
+    except Exception:
+        current_app.logger.exception("Failed to compute stats for %s", session_id)
+        return error_response(
+            ErrorCode.INTERNAL_ERROR,
+            "Failed to compute session stats",
+            500,
+        )

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@api/sessions.py` around lines 65 - 75, The endpoint currently wraps
parse_session(filepath) and compute_stats(session) in a single try/except so
parse failures are treated as INTERNAL_ERROR; split them into two blocks: call
parse_session(filepath) inside its own try/except and on parse failure log via
current_app.logger.exception("Failed to parse session %s", session_id) and
return error_response(ErrorCode.PARSE_ERROR, "Failed to parse session", 400)
(catch the specific parse exception class if available, e.g., ParseError or
ValueError), then call compute_stats(session) in a separate try/except that logs
via current_app.logger.exception("Failed to compute stats for %s", session_id)
and returns the existing INTERNAL_ERROR response on other failures.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@api/export_api.py`:
- Around line 286-295: Wrap the calls to parse_session(filepath) and
compute_stats(session) in try/except blocks so parsing or stats errors return
structured error_response objects (use the existing error_response and ErrorCode
enums) instead of raising uncaught exceptions; specifically, catch exceptions
from parse_session in the block where parse_session is invoked (before
is_session_excluded) and return a clear error_response (e.g.,
ErrorCode.SESSION_NOT_FOUND or a validation error) with a short message
including the exception text and an appropriate HTTP status, and catch
exceptions from compute_stats(session) where compute_stats is called and return
error_response(ErrorCode.INTERNAL_ERROR, "Failed to compute stats: <error>",
500) so all failures yield stable responses containing the expected code field.

In `@static/js/shared/state.test.js`:
- Around line 5-10: The test suite currently only resets the shared singleton in
an afterEach block, which leaves the first test vulnerable to prior mutations;
add a beforeEach that performs the same resets as the existing afterEach (reset
state.currentProject, state.cachedSessions, state.projectDisplayNames,
state.navInProgress) so each test starts with a deterministic state; locate the
existing afterEach in the file (and the shared state object reference named
state) and duplicate its reset logic inside a beforeEach at the top of the
suite.

In `@tests/test_cli_e2e.py`:
- Around line 70-73: The test test_cli_invalid_since_exits_nonzero is
under-specified because it calls _run_cli(["--since", "yesterday"]) without a
subcommand so a non-zero exit could be from argument shape errors rather than
--since validation; update the test to invoke the CLI with a real subcommand
(use whatever existing subcommand your CLI expects when exercising --since) via
_run_cli, assert proc.returncode != 0, and also assert proc.stderr contains the
specific validation error text related to --since (or a known error fragment) to
ensure the failure is the intended validation path.

---

Duplicate comments:
In `@api/sessions.py`:
- Around line 65-75: The endpoint currently wraps parse_session(filepath) and
compute_stats(session) in a single try/except so parse failures are treated as
INTERNAL_ERROR; split them into two blocks: call parse_session(filepath) inside
its own try/except and on parse failure log via
current_app.logger.exception("Failed to parse session %s", session_id) and
return error_response(ErrorCode.PARSE_ERROR, "Failed to parse session", 400)
(catch the specific parse exception class if available, e.g., ParseError or
ValueError), then call compute_stats(session) in a separate try/except that logs
via current_app.logger.exception("Failed to compute stats for %s", session_id)
and returns the existing INTERNAL_ERROR response on other failures.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: b3934581-cfaf-4dc7-841d-4cc84fc0bd73

📥 Commits

Reviewing files that changed from the base of the PR and between 20ac332 and b2f1331.

⛔ Files ignored due to path filters (1)

package-lock.json is excluded by !**/package-lock.json

📒 Files selected for processing (24)

.github/workflows/ci.yml
.gitignore
README.md
api/error_codes.py
api/export_api.py
api/search.py
api/sessions.py
package.json
pyproject.toml
requirements-dev.txt
static/js/shared/markdown.test.js
static/js/shared/state.test.js
static/js/shared/utils.test.js
tests/conftest.py
tests/fixtures/session_minimal.jsonl
tests/fixtures/session_with_thinking.jsonl
tests/fixtures/session_with_tools.jsonl
tests/test_api_integration.py
tests/test_api_routes.py
tests/test_cli_e2e.py
tests/test_error_codes.py
tests/test_export_api_bulk.py
tests/test_search.py
vitest.config.js

✅ Files skipped from review due to trivial changes (4)

package.json
pyproject.toml
.gitignore
README.md

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@api/sessions.py`:
- Around line 67-73: The except block that catches parser exceptions (in the
session handling function around current_app.logger.exception("Failed to parse
session %s", session_id)) returns ErrorCode.PARSE_ERROR with HTTP 400; change
the response status to 500 to match get_session's PARSE_ERROR handling and
maintain a consistent server-error contract. Update the error_response call in
that except handler to use a 500 status, and apply the same change to the
mirrored export-session parse handling so both endpoints return PARSE_ERROR as a
500 server error.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 8da4a479-8f9a-42d1-9a60-9ef18f6f6cce

📥 Commits

Reviewing files that changed from the base of the PR and between b2f1331 and 5b3f6b6.

📒 Files selected for processing (7)

api/export_api.py
api/sessions.py
tests/conftest.py
tests/test_api_routes.py
tests/test_cli_e2e.py
tests/test_error_propagation.py
tests/test_search.py

timon0305 · 2026-05-21T16:58:32Z

+    status: int,
+    **extra: object,
+) -> tuple[Response, int]:
+    body: dict[str, object] = {"error": message, "code": str(code)}


ErrorCode is StrEnum, so code is already a str subclass. str(code) works but code.value or just code is more idiomatic — both yield "SEARCH_INVALID_LIMIT" directly. Same nit applies to expected_code=str(code) in tests/test_error_codes.py (in the parametrized assertion at the bottom of the file). Optional.

timon0305 · 2026-05-21T17:00:17Z

+
+
+@pytest.fixture
+def client(tmp_path, export_state_file):


Four near-identical client fixtures (client, client_single, client_empty, client_thinking) repeat the same mkdir / copy / create_app / TESTING boilerplate. Same DRY concern I flagged on PR #40 — still not addressed. Worth collapsing to a parametric helper that takes a list of (fixture_file, output_name) tuples and returns a client. Recommend: a _make_client(tmp_path, export_state_file, seeds: list[tuple[str, str]]) private helper + 4 thin fixture wrappers around it.

timon0305 · 2026-05-21T17:01:12Z

+        globalThis.marked = {
+            parse: vi.fn((text) => `<p>${text}</p>`),
+        };
+        globalThis.DOMPurify = {


The mock replaces DOMPurify.sanitize with a regex that strips <script>, then the test asserts the output doesn't contain <script>. Same circular-mock issue I flagged on PR #40 — still not addressed.

timon0305 · 2026-05-21T17:02:25Z

+                404,
+            )
        return jsonify(session)
    except Exception:


Three except Exception: blocks catch too broadly. Same narrowing recommendation as before — (json.JSONDecodeError, KeyError, ValueError, OSError, FileNotFoundError).

Same as line 67, 78.

timon0305 · 2026-05-21T17:03:34Z

Four broad except Exception: sites — the two inside the export loops (162, 224) continue silently and the two below (289, 309) wrap parse/stats failures.

Same as 224, 289, 309

timon0305 · 2026-05-21T17:04:49Z

@@ -226,14 +236,11 @@ def bulk_export():
        _write_state(new_sessions_map, count)

    if count == 0:
-        return (
-            jsonify(
-                {
-                    "error": "Nothing to export",
-                    "since": since,
-                }
-            ),
+        return error_response(
+            ErrorCode.EXPORT_NOTHING_TO_EXPORT,
+            "Nothing to export",


Reordering to if count == 0: return ... first (then write state for the count > 0 path) reads more linearly. Cosmetic.

timon0305 · 2026-05-21T17:06:32Z

+def test_cli_list_unknown_project_exits_zero_with_message(tmp_path):
+    base = _seed_base_dir(tmp_path)
+    proc = _run_cli(["list", "--base-dir", str(base), "--project", "does-not-exist"])
+    assert proc.returncode == 0
+    assert "No projects found" in proc.stdout


Test name says "unknown project" but assertion checks for "No projects found" (about no projects at all). If --project does-not-exist is supposed to show a project-specific "no such project: does-not-exist" message, the test would currently pass even on the wrong message

timon0305 · 2026-05-21T17:07:09Z

+omit = [
+    "tests/*",
+    "utils/md_exporter.py",
+    "utils/session_stats.py",
+    "utils/json_exporter.py",
+]


The three utils/ modules omitted from coverage are domain logic, not glue — session_stats.py produces the cost_estimate_usd shown in stats endpoints, md_exporter.py is used by every bulk-export test in this PR, and json_exporter.py likewise.

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tests/test_search.py (1)

52-55: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Assert result-shape/cap for whitespace limit as well.

This test currently verifies only status code, so it won’t catch regressions where whitespace no longer defaults to the capped limit behavior.

Suggested test update

 def test_limit_whitespace_defaults(client_single):
     resp = client_single.get("/api/search?q=Hello&limit=%20%20%20")
     assert resp.status_code == 200
+    _assert_search_hits(resp.get_json(), max_items=50)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/test_search.py` around lines 52 - 55, The test
test_limit_whitespace_defaults only asserts status code; update it to also
assert the response result shape and that the returned hits are capped when
limit is whitespace — e.g., parse resp.json(), assert it contains a "results"
(or "hits") array and that len(resp.json()["results"]) equals the expected
default cap (or is <= the configured MAX_LIMIT), using the existing
client_single GET to "/api/search?q=Hello&limit=%20%20%20" so regressions in
whitespace limit handling are caught.

🧹 Nitpick comments (2)

api/error_codes.py (1)

27-28: ⚡ Quick win

Protect reserved response fields from accidental override.

extra currently overwrites "error"/"code" if those keys are passed, which can silently break the structured-error contract.

Proposed hardening

-    body: dict[str, object] = {"error": message, "code": str(code)}
-    body.update(extra)
+    body: dict[str, object] = {"error": message, "code": str(code)}
+    for key, value in extra.items():
+        if key in {"error", "code"}:
+            continue
+        body[key] = value
     return jsonify(body), status

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@api/error_codes.py` around lines 27 - 28, The response builder currently does
body: dict[str, object] = {"error": message, "code": str(code)} followed by
body.update(extra), which allows callers to override the reserved "error" and
"code" fields; change the update so reserved keys are protected (e.g., filter
out "error" and "code" from the extra mapping before updating or use logic that
only adds keys not in {"error","code"}) so that the variables body and extra
cannot overwrite the structured fields "error" and "code".

static/js/shared/markdown.test.js (1)

42-46: 💤 Low value

Consider adding a DOMPurify fallback test for consistency.

The suite tests the fallback when marked is unavailable, but there's no parallel test for when DOMPurify is missing. Adding one would verify that renderMarkdown handles both missing dependencies gracefully.

✨ Optional test to add

After line 46, you could add:

it('handles missing DOMPurify gracefully', () => {
    delete globalThis.DOMPurify;
    const html = renderMarkdown('Hello **world**');
    // Assert expected fallback behavior based on markdown.js implementation
    expect(html).toBeDefined();
});

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@static/js/shared/markdown.test.js` around lines 42 - 46, Add a parallel test
that verifies renderMarkdown handles missing DOMPurify similar to the existing
marked fallback test: in the test, delete globalThis.DOMPurify, call
renderMarkdown with a sample string (e.g., 'Hello **world**' or any markdown
input), and assert the returned HTML is defined or matches the expected fallback
output; reference the existing test for marked and the renderMarkdown function
to mirror setup/teardown and expectations so the suite covers both missing
dependencies.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@tests/test_cli_e2e.py`:
- Around line 23-36: The helper _run_cli currently calls subprocess.run without
a timeout which can hang CI; update _run_cli to enforce a bounded timeout by
adding a timeout argument (e.g., timeout: float = 30.0) to the function
signature or otherwise passing a fixed timeout value, and pass that timeout into
the subprocess.run(...) call so the CLI process is killed and the test fails
instead of hanging; modify references to _run_cli if you add a parameter and
ensure subprocess.run receives the timeout parameter.

---

Outside diff comments:
In `@tests/test_search.py`:
- Around line 52-55: The test test_limit_whitespace_defaults only asserts status
code; update it to also assert the response result shape and that the returned
hits are capped when limit is whitespace — e.g., parse resp.json(), assert it
contains a "results" (or "hits") array and that len(resp.json()["results"])
equals the expected default cap (or is <= the configured MAX_LIMIT), using the
existing client_single GET to "/api/search?q=Hello&limit=%20%20%20" so
regressions in whitespace limit handling are caught.

---

Nitpick comments:
In `@api/error_codes.py`:
- Around line 27-28: The response builder currently does body: dict[str, object]
= {"error": message, "code": str(code)} followed by body.update(extra), which
allows callers to override the reserved "error" and "code" fields; change the
update so reserved keys are protected (e.g., filter out "error" and "code" from
the extra mapping before updating or use logic that only adds keys not in
{"error","code"}) so that the variables body and extra cannot overwrite the
structured fields "error" and "code".

In `@static/js/shared/markdown.test.js`:
- Around line 42-46: Add a parallel test that verifies renderMarkdown handles
missing DOMPurify similar to the existing marked fallback test: in the test,
delete globalThis.DOMPurify, call renderMarkdown with a sample string (e.g.,
'Hello **world**' or any markdown input), and assert the returned HTML is
defined or matches the expected fallback output; reference the existing test for
marked and the renderMarkdown function to mirror setup/teardown and expectations
so the suite covers both missing dependencies.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 7dcff1ca-4e45-4767-8c3b-4afdf9b6a02a

📥 Commits

Reviewing files that changed from the base of the PR and between ba2117d and 6bb2245.

⛔ Files ignored due to path filters (1)

package-lock.json is excluded by !**/package-lock.json

📒 Files selected for processing (16)

README.md
api/error_codes.py
api/export_api.py
api/search.py
api/sessions.py
package.json
static/js/shared/markdown.test.js
static/js/shared/state.test.js
tests/conftest.py
tests/test_api_integration.py
tests/test_api_routes.py
tests/test_cli_e2e.py
tests/test_error_codes.py
tests/test_error_propagation.py
tests/test_export_api_bulk.py
tests/test_search.py

✅ Files skipped from review due to trivial changes (1)

README.md

…, exceptions)

Extract Flask client fixtures to tests/conftest.py, use client_single for search limit tests, and relax session count assertion to >= 1. Include utils/ in coverage (omit untested export modules), stop tracking .coverage, and align integration-tests CI with full cov scope. Co-Authored-By: Cursor <cursoragent@cursor.com>

- Add ErrorCode enum and error_response() helper; migrate api/search, api/sessions, and api/export_api error paths to include stable "code" - Validate search limit query param (400 SEARCH_INVALID_LIMIT; cap at 500) - Add tests/conftest.py, fixtures, test_api_routes, test_cli_e2e, test_error_codes, test_search; extend test_export_api_bulk for codes - Document error code catalog in README

…, exceptions)

coderabbitai Bot reviewed May 21, 2026

View reviewed changes

Comment thread api/sessions.py

Comment thread tests/conftest.py

clean6378-max-it force-pushed the test/http-cli-errors-and-codes branch from 20ac332 to b2f1331 Compare May 21, 2026 01:00

coderabbitai Bot reviewed May 21, 2026

View reviewed changes

Comment thread api/export_api.py Outdated

Comment thread static/js/shared/state.test.js

Comment thread tests/test_cli_e2e.py Outdated

coderabbitai Bot reviewed May 21, 2026

View reviewed changes

Comment thread api/sessions.py Outdated

timon0305 requested changes May 21, 2026

View reviewed changes

clean6378-max-it force-pushed the test/http-cli-errors-and-codes branch from ba2117d to 6bb2245 Compare May 22, 2026 14:27

coderabbitai Bot reviewed May 22, 2026

View reviewed changes

Comment thread tests/test_cli_e2e.py Outdated

clean6378-max-it added a commit that referenced this pull request May 22, 2026

fix: PR #42 review feedback (CLI timeout, search asserts, error codes…

fbec114

…, exceptions)

clean6378-max-it requested a review from timon0305 May 22, 2026 14:45

timon0305 approved these changes May 22, 2026

View reviewed changes

clean6378-max-it requested a review from wpak-ai May 22, 2026 14:57

clean6378-max-it and others added 6 commits May 23, 2026 01:19

test: add API integration suite, fixtures, and frontend unit tests

45bc5fd

fix: split parse/stats errors in export and stats routes; tighten tests

514567e

fix: PARSE_ERROR 500 on stats/export; tighten CLI and state tests

c2f1616

fix: PR #42 review feedback (CLI timeout, search asserts, error codes…

5daf230

…, exceptions)

clean6378-max-it force-pushed the test/http-cli-errors-and-codes branch from fbec114 to 5daf230 Compare May 22, 2026 17:44

fix: add @rollup/rollup-linux-x64-gnu for CI vitest on Linux

3ad3543

wpak-ai approved these changes May 22, 2026

View reviewed changes

wpak-ai merged commit c3d9b8b into master May 22, 2026
6 checks passed

wpak-ai deleted the test/http-cli-errors-and-codes branch May 22, 2026 18:02

coderabbitai Bot mentioned this pull request May 22, 2026

Docs/api reference and contributing #44

Merged

Conversation

clean6378-max-it commented May 21, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

API

Tests

Docs

Example error response

Uh oh!

coderabbitai Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Possibly related issues

Possibly related PRs

Suggested labels

Suggested reviewers

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

clean6378-max-it commented May 21, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 21, 2026 •

edited

Loading